首页> 外文OA文献 >Acoustic Reflector Localization: Novel Image Source Reversion and Direct Localization Methods
【2h】

Acoustic Reflector Localization: Novel Image Source Reversion and Direct Localization Methods

机译:声学反射器定位:新型图像源回归和直接   本地化方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Acoustic reflector localization is an important issue in audio signalprocessing, with direct applications in spatial audio, scene reconstruction,and source separation. Several methods have recently been proposed to estimatethe 3D positions of acoustic reflectors given room impulse responses (RIRs). Inthis article, we categorize these methods as "image-source reversion", whichlocalizes the image source before finding the reflector position, and "directlocalization", which localizes the reflector without intermediate steps. Wepresent five new contributions. First, an onset detector, called the clustereddynamic programming projected phase-slope algorithm, is proposed toautomatically extract the time of arrival for early reflections within the RIRsof a compact microphone array. Second, we propose an image-source reversionmethod that uses the RIRs from a single loudspeaker. It is constructed bycombining an image source locator (the image source direction and range (ISDAR)algorithm), and a reflector locator (using the loudspeaker-image bisection(LIB) algorithm). Third, two variants of it, exploiting multiple loudspeakers,are proposed. Fourth, we present a direct localization method, the ellipsoidtangent sample consensus (ETSAC), exploiting ellipsoid properties to localizethe reflector. Finally, systematic experiments on simulated and measured RIRsare presented, comparing the proposed methods with the state-of-the-art. ETSACgenerates errors lower than the alternative methods compared through ourdatasets. Nevertheless, the ISDAR-LIB combination performs well and has a runtime 200 times faster than ETSAC.
机译:声反射器的定位是音频信号处理中的重要问题,它直接应用于空间音频,场景重建和源分离。最近已经提出了几种方法来估计在给定房间脉冲响应(RIR)的情况下声反射器的3D位置。在本文中,我们将这些方法归类为“图像源反转”和“直接定位”,其中“图像源反转”在找到反射器位置之前先对图像源进行本地化,而“中间定位”则在没有中间步骤的情况下对反射器进行定位。我们提出了五项新的贡献。首先,提出了一种称为聚类动态规划投影相位斜率算法的启动检测器,以自动提取紧凑麦克风阵列的RIR中早期反射的到达时间。其次,我们提出一种图像源还原方法,该方法使用来自单个扬声器的RIR。它是通过组合图像源定位器(图像源方向和范围(ISDAR)算法)和反射器定位器(使用扬声器图像对分(LIB)算法)而构造的。第三,提出了利用两个扬声器的两个变体。第四,我们提出了一种直接的定位方法,即椭球正切样本一致性(ETSAC),它利用椭球的性质来定位反射器。最后,对模拟和测量的RIR进行了系统的实验,将所提出的方法与最新技术进行了比较。与通过我们的数据集相比,ETSAC生成的错误比其他方法要低。但是,ISDAR-LIB组合的性能良好,运​​行时间比ETSAC快200倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号